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Abstract 

o 

CN ' Motivated by applications in energy-efficient scheduling in data centers, Khuller, Li, and Saha in- 

^ . troduced the machine activation problem as a generalization of the classical optimization problems of 

minimum set cover and minimum makespan scheduling on parallel machines. In this problem, a set of n 
jobs have to be distributed among a set of ni (unrelated) machines, given the processing time of each job 
on each machine. Additionally, each machine incurs a startup cost if at least one job is assigned to it. The 
P^ . goal is to produce a schedule of minimum total startup cost subject to a constraint L on its makespan. 

While Khuller et al considered the offline version of this problem, a typical scenario in scheduling is 
one where jobs arrive online and have to be assigned to a machine immediately on arrival. We give an 
{0{\og{mn) logm), (9(logOT))-competitive randomized online algorithm for this problem, i.e. the sched- 
ule produced by our algorithm has a makespan of (9(Llogm) with high probability, and a total expected 
C/3 ' startup cost of (9(log(m«)logm) times that of an optimal offline schedule with makespan L. Our algo- 

^ rithm is almost optimal since it follows from previous results that the two approximation factors cannot 

be improved to o(log»dog«) (under standard complexity assumptions) and o{\ogm) respectively. 

Our algorithms use the online primal dual framework introduced by Alon et al for the onUne set cover 
problem, and subsequently developed further by Buchbinder, Naor and co-authors in various papers. To 
the best of our knowledge, all previous applications of this framework have been to linear programs 
\^ ' (LPs) with either packing or covering constraints. One novelty of our application is that we use this 

'sj" . framework for a mixed LP that has both covering and packing constraints. We combine the packing 

P^ ' constraint with the objective function to design a potential function on the machines that is exponential 

f^ . in the current load of the machine and linear in the cost of the machine. Then, we create a dynamic 

CN ' order of machines based on this potential function and assign larger fractions of the job to machines 

that appear earlier in this order This allocation is somewhat unusual in that the increase in load on a 
machine is inverse in the value of this potential function itself, i.e. inverse exponential in the current load 
J^ . on the machine. Finally, we show that we can round this fractional solution online using a randomized 

r\ • algorithm. We hope that the algorithmic techniques developed in this paper to simultaneously handle 

C^ . packing and covering constraints will be useful for solving other online optimization problems as well. 
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1 Introduction 

In recent times, the emergence and widespread use of large-scale data centers with massive power re- 
quirements has elevated the problem of energy-efficient scheduling to one of paramount importance (see 
e.g. IS). A natural strategy for achieving energy savings is that of partial shutdown, i.e. only a subset of 
machines/processors are active at any point of time. This immediately leads to the following scheduling 
question: which set of machines should be activated to serve a given set of jobs? Note that such a schedule 
must address twin objectives: 

• The total cost (e.g. in terms of energy consumption) of all machines used in the schedule must be 
small (thereby achieving energy efficiency). 



The sum of processing times of all jobs assigned to any machine must be small (thereby satisfying 
throughput requirements). 



• 



Motivated by this application, Khuller, Li, and Saha lUOl introduced the machine activation problem that 
involves scheduling jobs to machines so as to minimize the cost of the machines used in the schedule, while 
ensuring that the "load" on any machine is small. Observe that treating each of these objectives individ- 
ually leads to classical problems in combinatorial optimization, namely minimum set cover and minimum 
makespan scheduling on parallel machines, that have been extensively studied over the last thirty years. The 
novelty of the algorithm proposed in lITOl for the machine activation problem lies in being able to handle 
both objectives simultaneously. 

More formally, let M be a set of m machines and / be a set of n jobs, where the processing time of job 

i on machine / is pij > 0. Further, machine / has startup cost Cj. A schedule is defined as an assignment 

is) 
S : J ^ M of jobs to machines; we denote the set of jobs assigned to machine / in schedule S by J- . The 

(s) 
set of active machines M^ in schedule S are the machines to which at least one job has been assigned, i.e. 

(S) (S) (S) 

M^ = {i £ M : J- / 0}, and the cost of schedule S is defined as Y,j^{s) Cf. The load i) on machine / is 

(s) (s) 

the sum of processing times of all jobs assigned to machine /, i.e. i) = T" . js) Pa, and the makespan iinax 

(s) (s) 

is the maximum load on a machine, i.e. i)-ni^ = max/g^^) • (Often, we will drop the superscript (5) in the 

above notation if the schedule is clear from the context.) The objective of the machine activation problem 

is to obtain a schedule of minimum cost, subject to the constraint that its makespan is at most some given 

value L. 

In real-life scheduling tasks, the set of jobs is often not known in advance. This has motivated extensive 

algorithmic research in online scheduling problems, where the set of machines are available offline but the 

jobs appear online and have to be scheduled to a machine when they arrive. A natural and important question 

left open in ifTOl was to obtain an algorithm for the machine activation problem in the online model. Here, 

the set of machines, their individual startup costs, and the budget on the total startup cost of the machines 

activated by the schedule are known offline, but the jobs amve online. The processing time of a job on each 

machine is also revealed on arrival of the job. The goal is to assign the arriving job to a machine such that 

the cost of the resulting schedule is minimized subject to the constraint that its makespan is at most L. We 

call this the online machine activation problem. 

Our Contributions. Our main contribution is a randomized online algorithm for the machine activation 
problem with a bicriteria competitive ratio of (0(log(mn)logm),0(logm)): suppose an offline optimal 
schedule for an instance of our problem has cost B and makespan at most L; then our online algorithm 
produces a schedule of expected cost 0(Slog(m?i)logm) and makespan O(Llogm) with high probability. 
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Theorem 1. There is a randomized online algorithm for the machine activation problem that has a bicriteria 
competitive ratio o/(0(log(wi«)logm),0(logm)). 

In the minimum set cover problem, we are given a collection of subsets defined on a universe of elements. 
The goal is to select a minimum cost sub-collection such that every element of the universe is in at least one 
selected subset. If the elements appear online and the current selection of subsets must cover every element 
that has appeared thus fai^, a lower bound of Q (log ni log n) is known ifTTI for m sets and n elements, under 
standard complexity assumptions. Since the set cover problem is a special case of the machine activation 
problem where the limit L on the makespan of the schedule is oo, the competitive ratio in the cost of the 
schedule for any online algorithm for the machine activation problem must be ^(logmlogn). 

On the other hand, the (minimum makespan) scheduling problem for unrelated pai^allel machines is 
defined as that of distributing n jobs among m machines so as to minimize the makespan of the schedule 
(machines do not have cost). It can be shown using standard techniques that an online algorithm that pro- 
duces a schedule of makespan at most aL for the online machine activation problem can be used to obtain 
an 0(a) -competitive algorithm for the online scheduling problem on unrelated parallel machines. It is well- 
known m that the competitive ratio of any algorithm for the latter problem is Q.{\ogm); therefore, this lower 
bound also holds for the online machine activation problem. 

Theorem 2. No algorithm for the online machine activation problem can have a competitive ratio of 
o(logm) in the makespan. Further, under standard complexity assumptions, no algorithm for the online 
machine activation problem can have an approximation factor o/o(logmlogn) in the cost of the schedule. 



Our Techniques. Our algorithm draws inspiration from the techniques used to solve the online versions of 
the set cover problem and the scheduling problem for unrelated pai^allel machines. So, let us first summarize 
the key ideas involved in these two algorithms. For the latter problem, Aspnes et al 13 gave an elegant 
solution based on the following exponential potential function: if the current load on machine / is Ij, then 
its potential is a'^' for some constant a. The algorithm assigns the arriving job to a machine that suffers the 
minimum increase of potential, i.e. to machine / = argmin,eM(fl''^'^^'^ — a^'). Observe that if all processing 
times are scaled down sufficiently and a is small enough, then a^'^P'J — a^' ~ a^'{a — l)pij, i.e. the increase 
in potential function is linear in the processing time but exponential in the current load on the machine. 
Therefore, the algorithm favors lightly loaded machines in preference to those offering low processing times. 
It was shown in |2j that this strong "bias" for lightly loaded machines ensures that the sum of the potentials 
of all machines is at most 0{m) times that in an optimal offline schedule, thereby leading to a competitive 
ratio of O(logm) on the makespan of the schedule. 

For the online set cover problem, Alon et al fl] introduced a two-phase online primal dual framework 
that works as follows. In the first phase, the goal is to obtain a feasible fractional solution to the online 
instance of the problem. In each step of this phase, in response to a new constraint (i.e. a new element in the 
online set cover problem) that arrives online, the fractional solution is updated to preserve feasibility while 
ensuring that the cost incurred can be accounted for by a suitably updated dual solutionis In the second 
phase, the fractional solution is rounded online to obtain an integer solution. It is important to note that 
while the two phases are presented sequentially for clarity, the algorithm needs to operate both phases (the 
fractional updates followed by the rounding) in response to response to the arrival of a new constraint. These 
two phases aim to notionally distinguish between the information-theoretic aspect of the online problem and 



'The dual was not used explicitly in Alon et aPs original analysis. In fact, we will also not use the dual explicitly even though 
our algorithm can also be analyzed via the dual. 



the computational aspect of rounding a fractional solution. In fact, Alon et al showed for m sets and n 
elements, this two-phase framework can be used to design an algorithm that has a competitive ratio of 
O(logmlogn), where the O(logm) factor arises in the first phase due to information- theoretic limitations 
of the online algorithm, and the OiXogn) factor arises in the second phase due to computational limitations 
encapsulated by the integrality gap of the linear program (LP) for set cover. 

Minimize LreM CfJC; subject to 

Y^Pijyij < JCiL V/eM (1) 

y,j < X, VieM, jeJ (2) 

Y^y.j > 1 V;G/ (3) 

Xi e {0,1} V/eM (4) 

ytj e {0,1} \^ieM,jeJ (5) 

Figure 1: The integer scheduling linear program (or ISLP). In the fractional scheduling linear program (or FSLP), 
Eqns.|4]and|5]are relaxed to < x, < 1 for all machines / G M, and < yjj < 1 for all machines / G M and jobs j G J, 
respectively. 

This two-phase primal dual framework has since been extensively used for various online problems (see 
||71 for a survey); our algorithm also uses this framework. However, to the best of our knowledge, whereas 
all previous applications of the framework have been to LPs that have exclusively covering or packing 
constraints, we apply the framework to a mixed LP. Consider the integer LP fomiulation of our problem 
given in Fig. [I](we call this the integer scheduling LP or ISLP). The variable Xi is 1 iff machine / is active, 
and jij is 1 iff job j is assigned to machine /. In the fractional relaxation (which we call the fractional 
scheduling LP or FSLP), these variables are constrained to be in the range [0, 1] instead of Eqns. |4] and |5] 
Note that we have both covering (Eqn. O and packing (Eqn. [T]) constraints in the ISLP/FSLP 

We interpret the onUne set cover algorithm as one that maintains a bound on a potential function that is 
linear in the cost of the active machines, whereas the online scheduling algorithm translates its objective of 
minimizing makespan into maintaining a bound on the value of a potential function that is exponential in the 
load on each machine. We design a potential function that combines both objectives: it is linear- in the startup 
cost and exponential in the load on a machine. Our fractional algorithm preferentially assigns (fractions of) 
jobs to machines in a way that leads to a small increase in this potential. It should be noted that even if we 
use this potential function, we cannot afford to simply assign the entire job to the machine that would suffer 
the minimum increase in potential — such a greedy strategy can be easily shown to fail even for the special 
case of the online set cover problem. Instead, our algorithm creates a dynamic list of machines in increasing 
order of its change in potential if the current job were assigned to it, and then assigns larger fractions of the 
job to machines that appear earlier in the order. This allocation is also somewhat unusual in that the increase 
in load on a machine is inverse in the value of this potential function itself, i.e. inverse exponential in the 
cun^ent load on the machine. For some of the machines in the order, this might also involve increasing the 
fraction to which the machine is active, i.e. increasing the value of ;c,-, if the assignment violates Eqn.[I]or|2] 
The analysis of the fractional algorithm involves proving a bound on the value of the cumulative potential 
function over all the machines. Depending on the behavior of a fixed optimal offline solution (which is 
unknown to the algorithm and is used only for analysis), we classify jobs into thr^ee different categories, and 
prove a bound on the total increase in potential due to jobs in each individual category using three different 
techniques: 



• The increase of potential for jobs in category 1 is charged globally to the offline optimal solution using 
a primal dual argument (without introducing the dual solution explicitly) similar in spirit to that used 
by Alon et al in HI for the online set cover problem. 

• We give a bound on the increase in potential for each individual job in category 2 by showing that the 
ratio of increase in potential to the fraction of job assigned is bounded. 

• We give a global bound on the increase of potential for jobs in category 3 by using a recursive ar- 
gument which is similar- in spirit to the one used in the online scheduling algorithm for unrelated 
machines by Aspnes et al^. 

The novelty of our analysis lies in being able to seamlessly combine, and non-trivially extend, the disparate 
techniques from lU and Q. Ultimately, we show that the potential of the fractional schedule produced by 
our algorithm is O(mlogm), where the cost and makespan of the offline optimal solution are respectively 
Q.{m) and 1 by a suitable initial scaling. We hope that the algorithmic techniques developed in this paper to 
simultaneously handle packing and covering constraints will be useful for solving other online optimization 
problems that can be expressed as mixed LPs as well. 

In the second phase of the algorithm, we use an online randomized rounding scheme to obtain an integer 
solution. To ensure that the expected cost of the integer schedule is bounded by that of the fractional 
schedule, each machine / is activated with probability proportional to x,. This is implemented online using 
standai^d techniques. The more challenging aspect of the rounding is the actual scheduling of jobs to active 
machines. The natural approach would be to assign job j to machine / with probability ytj. Translated to 
conditional probabilities, this implies that job j should be assigned to machine / with probability Zij =yij/xij 
if machine / is active, where x;y is the value of x, at the end of the update to the fractional solution for job 
j. This immediately implies that the expected load on a machine in the integer schedule is at most that in 
the fractional schedule. However, our goal is to obtain a bound on the makespan of the integer solution; 
in fact, since the events of jobs being assigned to a fixed machine are positively correlated, a bound on the 
expected load does not immediately yield a concentration bound on the load. Instead, we show that even if 
job i were to be assigned to machine / unconditionally with probability z,j, the expected load on machine / 
given by Y^jeJ^ij would be small. This overcomes the problem of positive correlation mentioned above since 
the events are no longer conditioned on machine / being active. We now derive concentration bounds on the 
load on a machine, which translates to a bound on the makespan of the integer schedule thereby proving 
Theorem [U 

Previous Work. Many variants of the machine scheduling (or load balancing) problem have been exten- 
sively studied in the literature. Perhaps the most celebrated result in scheduling theory is a 2-approximation 
for the offline minimum makespan scheduling problem for unrelated machines due to Lenstra, Shmoys and 
Tardos llT2l . which was later simplified by Shmoys and Tardos llT6l . Rather surprisingly, this algorithm 
continues to offer the best competitive ratio for this problem (and even for several natural special cases such 
as the restricted assignment problem) even after more than two decades of research. In the online setting, 
Graham ||8l|9l showed that the natural greedy heuristic achieves a competitive ratio of 2 — \/m for m iden- 
tical machines. The competitive ratio of this problem has been subsequently improved in a series of results 
(see e.g. HI and subsequent improvements). For the more general restricted assignment problem where the 
processing time of each job j on any machine is either some value pj or oo, an online algorithm having com- 
petitive ratio O(logm) was designed by Azar, Naor and Rom |!4l. This algorithm was later generalized to 
the unrelated machines scenario by Aspnes et al IS with the same competitive ratio. Various other models 



and objectives have been considered for the load balancing problem; for a comprehensive survey, see |l3] 
and ifTSl . In particular, the machine activation problem was introduced by Khuller et al in ifTOl . where they 
gave an 0(2(1 + 1/£)(1 +ln(?i/Of r)),2 + £)-approximation algorithm for any £ > 0. Recently, this result 
was extended by Khuller and Li |[T3l to a more general set of cost functions. 
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2 The Fractional Algorithm 



(6) 

(7) 
(8) 

(9) 

< yij < 1 \/ieM,jeJ (10) 

Figure 2: The relaxed fractional scheduling linear program (or RFSLP). Eqn|6]is enforced only for partially active 
machines /. (Note that for inactive machines, Eqns.|6]and|7]are identical.) 

In this section, we will describe the online updates to the fractional solution to maintain feasibility for 
FSLP on receiving a new job j. This involves updating the values of yij (denoting the fraction of job j 
assigned to machine /) so as to satisfy Eqn. [3l and coixesponding updates to the values of x, if Eqns. [T]or 
|2]is violated. In fact, we relax the constraints in FSLP in two ways. Let machine / be said to be inactive, 
partially active or fully active depending on whether x,- = 0, < x, < 1 or Xj = 1 respectively. First, for 
technical reasons, we relax Eqns. [T]and|2]to Eqns. |6] and |7] respectively (see Fig. [2]l. Further, we enforce 
Eqn.[T]only if x, < 1, i.e. if machine / is not fully active. The load on a fully active machine will be bounded 
sepai^ately in the analysis. We call this the relaxed fractional scheduling LP or RFSLP. 

Before describing these updates, let us set up some conventions that we will use throughout the paper. 
We divide all processing times by L at the outset; this allows us to assume that the makespan of the optimal 
solution is 1. We also assume that we know the value a of the optimal offline (integer) solution, i.e. the 
minimum startup cost of an offline assignment of jobs to machines that has makespan at most 1. This is 
also without loss of generality because we can guess the value of the optimal solution, doubling our guess 
whenever the current algorithmic solution exceeds the cost bounds that we are going to prove (thereby 
implying that our current guess is too small). In the following discussion, it is sufficient to know the value 
of a up to a multiplicative factor of 2, but for simplicity of presentation, we will assume that we know it 
exactly. 

Our algorithm uses the value of n, which is the total number of jobs that arrive online. If this value is 
not known offline, each job estimates n by assuming that it is the last job. We can show that using such 
estimates for n incurs a small additive factor of 0(loglog«) in the makespan, and an additive factor of 
0{lognlog{mn)) in the cost of the schedule. However, for simplicity, the rest of the paper assumes that the 
value of n is known offline. 

We define the virtual cost of job j on machine / as 



wU) 



cia^' ^pij, if machine / is fully active, i.e. X,- = 1 
CiPij, otherwise 



where a is a constant that we will fix later. (Recall that £i represents the load on machine /, i.e. £{ = 
Y^jeJPijyij-^ Let M{j) denote an ordering of machines in non-decreasing order of virtual cost r],(j) for job 
j. Let P{j) denote the maximal prefix of M{j) such that Y,ieP{j)^i < 1- (Note that P{j) may be empty.) If 
P{j) j^ M (j) , then k{j) denotes the first machine in M(j) that is not in ^(7); k{j) is undefined if P(7') =M{j). 
If Xi is increased to x, + Ajc, for a partially active machine /, then we say that the effective capacity created 
by this increase for job j is min{2xi,6Axj /pjj). Note that the effective capacity created by an increase in Xj 
is a feasible increase in the value of Vjj independent of the current load on machine /. 

2.1 The Algorithm 

The algorithm has two phases — an offline pre-processing phase, and an online phase that (fractionally) 
schedules the arriving jobs. 

Pre-processing. We multiply the startup cost of every machine by a/m, and discard machines with startup 
cost greater than m (after the scaling) at the outset. Further, for every machine / whose startup cost is at most 
1 , we increase its cost to 1 , and initialize x, to 1 . For all other machines with 1 < c, < m, we initialize x, to 
l/m. At the end of the pre-processing phase, we have the following properties: 

• The cost of an optimal solution is between m and 2m. 

• The cost of every machine is between 1 and m. 

• Every machine whose cost is 1 is fully active; all other machines have Xj = l/m. 

Online Algorithm. Suppose job j anives online. We increase yijS using the following mles repeatedly 
until Eqn.[8]is satisfied. 

• Type A: k{j) is undefined (i.e. P{j) = M{j)) or X;t(y) < 1 (i-c- machine k{j) is not fully active). We 

increase x, to min(x;(l + l/c,?i), 1) for each machine / G P{i) (and also for / = k{i) if it is defined), 
and correspondingly increase j,y by the effective capacity created in each machine. 

• Type B: x,t(;) = 1 (i-c machine k{j) is fully active). We increase x,- to min(x,(l + l/c,n), 1) for each 
machine / G P{i), and correspondingly increase yij by the effective capacity created in each machine. 
Further, we increase yk(j)j by (i/f]k(j] U)^- 

2.2 Analysis 

Our goal is to show the following bounds on the makespan and cost of the fractional schedule. 

Lemma 1. The fractional schedule produced by the online algorithm satisfies Y^jejyuP'J ~ 0{logm) for 
each machine i, andJ^ieM^i^i — 0{mlogm). 

We introduce a potential function 0, for machine / defined as 

c,a^'^\ if machine / is fully active, i.e. x, = 1 



^^ ' C.X, otherwise. 

The cumulative potential function ^ = Y.ieM'i'i- Our goal will be to show that ^ = 0{m\ogm). This will 
immediately imply Lemma [T] 



We prove the bound on ^ in three steps. First, we bound the increase of ^ in the pre-processing phase; 
next, we bound the increase of ^ in each algorithmic step (of either type A or type B); and finally, we bound 
the total number of algorithmic steps. 
Pre-processing. The next lemma bounds the increase of ^ in the pre-processing phase. 

Lemma 2. At the end of the pre-processing phase, ^ < m. 

Proof. The startup cost of each machine that is fully active after pre-processing is 1 ; on the other, every 
partially active machine / has c, < m and xi = 1/m after pre-processing. D 

Single Algorithmic Step. Now, we bound the increase in due to a single algorithmic step of either type. 

Lemma 3. The increase in potential in a single algorithmic step of type A is at most Ijn. 

Proof. The total increase in potential in an algorithmic step of type A (due to increase in the value of x, for 
machines / € P{f) U {^(j)}) is at most I,-eP(;)u{/t(y)} c/(^//c,-«) = (I,-eP(;)u{/tO)}-^()/« < 2/«. D 

Lemma 4. For any constant l<a< 13/12, the increase in potential in a single algorithmic step of type B 
is at most Ijn. 

Proof. The total increase in potential for machines / G P{f) due to an algorithmic step of type B is at most 
Y^i^p[i\C\(xil Cin) = {'^iep(j)Xi)/n < l/n. The other source of increase in potential is the scheduling of a 
fraction of job j to machine k{j), due of which the load on machine k{j) increases by 6/c^(j)a *(j)^'n. The 
resulting increase of potential (^^qj is 



c,(,.)a^^O)-i(^(l + (a-l))^/^*o-)^'*'^'"«-l) < c^j/^uy 



12(a-l) 1 

< -, 



Ck{j)a^''^J^ ^n n 



The penultimate inequality follows from the property that ( 1 +x) '-' < e^'^' < 1 + 2x/y, for any y>x>0. D 

Number of Algorithmic Steps. We classify the algorithmic steps according to a fixed optimal offline 
(integer) schedule that we call OPT. Suppose OPT assigns job j to machine 0Pt(7), and letMopx denote the 
machines that are active in the optimal offline schedule. The three categories are: 

1. 0PT(7) G P{j). 

2. OPt(j) ^ P{j) and OPt(7') is partially active. 

3. 0Pt(7) is fully active. 

The next lemma bounds the total increase in potential due to algorithmic steps in the first category above. 
Lemma 5. The total increase in potential due to all algorithmic steps in the first category is 0{mlogm). 



Proof. In any algorithmic step of the first category, the value of ■^opt(7) either increases to XopT(y) ( 1 + ^ . „ 
or to 1. Since Xi is initialized to at least 1/m for each machine / in the pre-processing phase, there are at 
most wi + LreMoPT *- '■" i°§ "^ ~ 0{mn\ogm) algorithmic steps of the first category. The lemma now follows 
fromLemmas|3]and|4] D 



The next lemma bounds the total increase in potential due to algorithmic steps in the second category. 

Lemma 6. TJie total increase in potential due to all algorithmic steps in the second category is 0{m). 

Proof. Consider the first 2cQpj^j\pQfjtj\jn algorithmic steps in the second category for any particular job / 
We have two cases: 

• Case 1: these algorithmic steps contain at least CopT(;)/'opT(y)y'^ steps of type B, or 

• Case 2: these algorithmic steps contain at least CopT(j)/'opT(y)y'^ steps of type A. 

In case 1, each algorithmic step of type B creates an effective capacity of 6/t]^q) {j)n in machine k{j). Since 
in each such algorithmic step, T\k(j){j) ^ Vopt{j)U) = '^opt{j)Popt{j)j^ the total effective capacity created by 
these algorithmic steps is at least 1. 

In case 2, let R{j) denote the set P{j) U {k{j)} for the last of these algorithmic steps. (Note that since 
OPt(j) ^ P{j), k{j) is defined.) Further, let x\ ' and x\ ' respectively denote the value of ;c, for machine / 
before the first algorithmic step for job j, and after the last algorithmic step. For each machine / G R{j), Xj 
has been increased in each of at least Copt(7)/'opt(7)7'J algorithmic steps of type A. Thus, for each machine 

ieR{j), 

^ OPT (j)l^ OPT (j)j 

since CopT(y)PopT{j)y ^ CiPij for all machines / G R{j)- The total effective capacity created in these steps is 

6(?'-^!^^) . 6xP)(l-2-P-0 ^ feP^ ^ (2) 
Pij Pij 2 

The first inequality follows from Eqn. [TT] while the second inequality follows from the observation that for 
any z < 1, we have (1 —2^^)/z > [1 — (1 — z + z^/2)]/z = 1 —z/2 > 1/2. Hence, the effective capacity 
created by these algorithmic steps is at least I^jeR(nxJ > 1. The lemma now follows from Lemmas |3] and 
IHcoupled with the fact that T,jeJ(^opj{j)PopjU)j < 2m. D 

Finally, we bound the total increase of potential due to algorithmic steps in the third category. 

Lemma 7. For any 1 < a < 13/12, the total increase in potential due to all algorithmic steps in the third 
category is at most 2 + (2/3) Y^ieMa <^;'^^' ^ where Li is the final load on machine i in the schedule produced 
by the algorithm and Ma is the set of machines that are fully activated by the algorithm. 

Proof. First, we consider an algorithmic step of type A. In such a step, k{i) must be defined since OPt(7) ^ P{j)', 
thus, 'LiePU)\j{k(j)}Xi > 1. Further, for each machine / e P{j) U {k{j)}, we have apij = r]i{j) < T]opt(;)(j)- 
Thus, the fraction of job j assigned in this algorithmic step is at least 

£ min( ^^,2x;) >min( 7^,1)' 

since T,ieP{j)u{k(j)}^i ^ 1- We first consider the situation where the whole of job j was assigned in this 
algorithmic step. In this case, the increase in potential due to job j is T,ieP{j)u{k(j)}{'^iXi)/{^i^) < 2/n; 
cumulatively for all jobs, such increases in potential add up to at most 2. Otherwise, the sum of increase in 




jij over all machines in this algorithmic step is at least ^/rJQpjijyn. Now, consider an algorithmic step of 
type B. Then the increase in yk{j)j is ^/^k{j)j^ ^ ^/^ovi{i)j^ since T\ovi(j)j ^ ^k{j)j- ^^ either case, the total 
number of algorithmic steps in the third category for job j is at most T]opT(y)y«/3. 

Note that r\ovY{i)i ^ '^opt(;)^^°"*^' VopT(y)y- Therefore, summing over all jobs, the total increase in 
potential due to algorithmic steps in the third category is bounded by (using Lemmas |3] and ID) 

jeJ iGMoptHMa 

where the penultimate inequality follows from the fact that the makespan of the optimal offline schedule is 
at most 1. n 

Finally, we bound the total potential 0; this immediately yields Lemma[T] 

Lemma 8. The online fractional algorithm produces a schedule that satisfies ^ = 0{mlogm). 

Proof. Lemmas 121 m [6] and |7] imply (p < 0{m\ogm) + (2/3)0, which proves the lemma. D 

3 The Online Randomized Rounding Procedure 

In this section, we give an online randomized rounding scheme for the fractional solution produced by the 
algorithm in the previous section. 

3.1 The Algorithm 

For each machine /, we select (offline) a number r, uniformly at random and independently from [0, 1]. In 
response to a new job j arriving online, the algorithm updates the schedule in thi^ee steps: 

• Fractional step. The fractional schedule is updated according to the algorithm described in the pre- 
vious section. 

• Activation step. Each inactive machine / that satisfies r, < 5xi{j)\n{mn) is activated. 

• Assignment step. Let Ma{j) be the set of active machines in the integer solution. Let Zij = yij/2xi{j) 
if Xi{j) < l/51n(m?i) and Zij = yij otherwise. Let qij be the normalized probability proportional to ztj 
in a distribution defined on the set of machines Ma{j). We assign job j to machine / with probability 

3.2 Analysis 

The next lemma is an immediate consequence of the fact that machine / is active in the integer solution with 
probability min(5A;,Tn(mn), 1). 

Lemma 9. The total startup cost of all machines activated in the integer schedule is 0{mlogm\og{mn)) in 
expectation. 



Proof. The expected startup cost of machine / in the integer schedule is at most 5c,x; In(m?2) ; the lemma now 
foUows from Lemma[I]and linearity of expectation. D 

To bound the makespan of the integer solution, we first bound the probabilities qij. (This lemma also shows 
that with high probability, at least one machine is active for even one job, thereby proving correctness of the 
algorithm.) 

Lemma 10. With probability at least 1 — \/m, qjj < Zijfor all machines i and jobs j. 

Proof. We show that Y^ii^Ma (j) ^'J — ^ ^^^^ probability at least 1 — 1 /mn for any job j; the lemma then follows 
using the union bound over all jobs. We classify machines into M\{j) and M2{j) depending on whether or 
not Xj (j) > 1 /5 In [mn). Every machine / G Mi (j) is also in Ma (j) . Therefore, the contribution of machines 
in Ml (7) to Y,ieMAU)^'J ^^ exactly Y,ieMi{j)yij- On the other hand, the contribution of each machine / ^M2{j) 
to this sum is yij/2xi{j) with probability 5xi{j)ln{mn), and otherwise. Define a random variable Z,-^ = 1 
with probability 5yijln{mn)/2, and otherwise. Since3',y/2;<;,(7) < 1, 






< 



L Z;y< £ yij 



1 
< , 

mn 



by Chemoff bounds (cf. e.g. Ull). D 

Lemma 11. The makespan of the integer schedule is O(logm) with probability 1 — 2/m. 

Proof. First, we prove that the load on any machine in the integer schedule is 0{logm + Y,jejyijPij) with 
probability at least 1 — l/m^ conditioned on the following: 

• Lemma [TOl holds . i.e. qjj < ztj for all machines / and jobs j, and 

• Machine / is active from the outset in the integer algorithm. 

Given that machine / is active from the outset and qij < ztj, the load on machine / due to job j is pij with 
probability at most z,y. Now, 






Let job / immediately precede job j in the online order; if j is the first job, Xi{j') = \/m. Then, 



jeJ:Xi{j)<\ 



"^ v.f 71 "^ I . . / •i\ ^A■} III ^A^ 



2^0- jeJ-MJXi '"'^J'^ jeJ-MJ)<i-^''='-^J'^ "" -^i/'" "" 



The first inequality follows from the fractional algorithm which assigns a fraction ytj < 6{xi{j) —Xi{f))/pij 
of job j to machine /. Since yijPij < pij < 1 for all jobs j, it follows using Chemoff bounds that the load on 
machine / is 0{logm + Y,jejyijPij) with probability at least 1 — 1/m^. 

Using the union bound over all machines and Lemma [TOl we can now claim that the load on machine / 
is 0(logm + T,jejyijPij) (unconditionally) for all machines /, with probability at least 1 — 2/m. The lemma 
now follows using Lemma [T] D 

Finally, we note that Lemmas l9l and [TT] imply Theorem [T] 
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